CDS

Accession Number TCMCG007C39222
gbkey CDS
Protein Id XP_009151195.2
Location complement(join(23106155..23106319,23106441..23106719,23106828..23106905,23107010..23107123,23107226..23107414,23107490..23107766,23107833..23108011,23108091..23108405,23108587..23108822,23108906..23108977,23109150..23109387,23109472..23109710,23109879..23110053,23110144..23110221,23110335..23110400,23110478..23110576,23110681..23110802,23111112..23111394))
Gene LOC103874522
GeneID 103874522
Organism Brassica rapa

Protein

Length 1067aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA249065
db_source XM_009152947.3
Definition protein ALWAYS EARLY 1 isoform X1 [Brassica rapa]

EGGNOG-MAPPER Annotation

COG_category K
Description binding, transcription factor
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGAAAAGGAGAATGCCTATTGGCTCTCTCTCTCTCTCAACTCTGGAGTCGTCGTCCTTTTATTCAAGTTTCGCGGCAACGCCTCTCTCCCCTCTCTCTCTCTGTCTCTTCCTTCATCTCCCAAATAATCTTTTTTTCTTTTTCAATCTCTCTCTCTCTCTTTTAAAAAACTTCTTTCGAAATTTCTCACGATCTCTCCAAGAAGCTTCTCAAAAGCTTTTCTCTCATCCGAGCTCTACTCCGTGTGGTTCAACGCAGATTCTCTTTGAGCTGCGTTACTCGGGTGTTGAATTGGATTGGGGAATTTGGTTGAAGGGAGAAGAGGATATGGCGCCGAGGAGTGTGAACAAGCGCGTCACCAAAGAAGCCTCCTCCTCTCCAGATATAGACAAAGTCAAGCAACGTAAGAAGAAGTTGACTGACAAGTTGGGACCTCAGTGGACGAAAGGTGAGCTTGAGCGTTTCTATGATGCCTATCGGAAGCACGGCAGAGACTGGAAAAAGGTAGCTGCTTCAGTGCGGAGTAACCGGTCTGCTGATATGGTGGAAGCCCTTTTTTCTATGAATCGGGCATATTTGTCCCTGCCCGAGGGAACTGCGTCTGTAGCTGGTCTCATTGCAATGATGACTGACCATTACAGCGTCATAGAGGAGAGCGAAAGTGAAGGAGAAGGCCACGGTGCTTCTGGAGTATCGAGGAAATATCAGAAGCGCAAACGTGCTAAAGTTCCGCCTAGTGATGTTCGAGAAGAAGTTATCTCACCACATTCAATTGCATCAACGGAAGGATGCCTCTCATTTTTGAAGACGACACAAGTTTATGGAAGGGAGCGACGTGCCACTGGTAAACGTACACCTCGGTTTCTTGTACCAAGTGCAGACCAGAGGGATGATACACAAGGTTCAACGCCACCAAATAAAAGAGCCAAGAAACAACTTGAGGCTGATGCTGATGATGATGATGATGATGATGATATACTAGCATTAGCATTGGCAAATGCATCGACAAGGGTTGGAGGGTCTCCATATAGAAGACCAGACACCACGCCAAATGTGAAAATGTCACAAGCTAAGGAAGCTCAATCCAAGCACCAAGCTAGCTCCATGTCTAGGAATGTGGTGAGAATAAGCCGAGATAGGAGGCACATTAAGAGATCTCCAGATAGAGATGGTGCCTTGTTGATGGATATAGAAGGGGTTGGTAACGCGGAGGTTCCTCGGAAGGAGAAAAATGTCAGAACTGTAGAAGCAGAAGGAGATACTTCTGATGATAGCGGAGAAGCATGCAACGCCCCAAGAGATGGATTAGAAGCTCTGCATGCATTGGCTGACTTGTCAGCTTTACTGACTCCGGGTGGTTTGATGGAATCAGAATCATCTGCGGAGTTGAAAGAAGAAAGAGTAGCTAACACTCGGGAAACCGTATCCAGTAGCCATACCAGAGAAAAGGCAAAACAAGCAGGACGAGAAGACCATAGTGTCCTACATGTAATTTCAGCTGCTGATAGTAGAAAACCGAAGTCTGCGCAGGAACTCGTTGATGGTAATGCTGTTCCCATAGGGGAACTTGACACTTCAAGAAGAAAACGTAAACCTCTACATAATAAGGAATCGGCTGAAGATGATAATTTGAAGACTTCGATCAACGCCAGACGTGTTGGTCAAGGTCCAGCAAAGCAGCAGAAAACAGCAAAGACATCGGAAGAATCTTGTTCAACTAGTGATAAGAAAATAACAAGACCAAATGAAGCAGTGTCAGCTACACAAGTTTCAGGTTCGGGTCCAGCGAGCTTGCCGCAGAAACCACCAAACAGGCGTAAGATTAGTTTGAAGAAAAGTTTACAAGAAAGAACTAAATCTTCTGAAACCACTCACAACAAGTCACATAGTTACGAAATAGATCCAGAACATGAGCTACTAAAGGACAAGGTTTCGACTTGTCTATCACATCCATTGGTACGTCGAAGGTGCATATTCGAATGGTTCTATAGTGCCATTGACTATCCGTGGTTTGCAAAGATGGAGTTCGTTGATTATCTGAATCACGTGGGTCTTGGTCACGTTCCAAGACTTACTCGTCTTGAATGGAGCGTCATTAAAAGTTCTCTTGGCAGACCTCGGAGATTCTCTGAGAGATTCGTACATGAAGAGCGGGATAAACTCGAACAATATCGTGAATCTGTGAGAAAGCAATACGCAGAGCTACGTGCAGGTGCTAGAGAAGTGCTTCATACAGATTTGGCCCAGCCTTTATCAGTTGGGAATAGAGTCATTGCCATCCATCCAAAAACACGGGAGATTCGTGATGGCAAGATTCTTACTATTGATCATAACAAGTACAACGTTCTGTTTGATGAGTTGGGAGTCGACGTGGTTATGGACATTGATTGCATGCCTTTAAATCCGTTAGAATATACTCCTGACGGTCTAAGGAGGCAAATGGATAACTGCTTGACTGTATGCGGAGAAGCACAGGTTAGGAAACACCCAAGCTCTGATGCATCTGTTCTGTTCACTCCTTCCGAGCTTGAAAATGTCGAATTTTCTATGAGTCATACTAAGAAAGAGGATGATAGAGACAGGCGAGTCACTACTGATCAAACGTATAACACAGCCAATCGCAAAGAAAGAAGAGATGAAATTCAACAAGATCTGATGCTGGAGCGTACTTCAGATGCACAGGAAATGGAGCCAGAAATGCTTGGAATTGTCAGTGGTTCAAGGTCAATAGCACAAGCTATGGTGGATGCAGCTATAAAGGCTGCATCTTCTGTGATGGATGACAAAGACGCAGGGAAGATGGTCATACAAGCTTTAGACTCAATCGGCGAACATCATCAGCCATTAGATAACTCTATAGTGTCTCGTATGAAGCATCAAGACCAAGCCAATGGCAGCTTGGATCATCATCATCAAAACCGGTCTCCCTCAAACACAGGAGAAGCCATGAATGAAGGATTGATGGGATCAGGGAAAAACGAAACGCAAATGGATTCAGAGCTGATCAGCTCTTGTGTTGCAACTTGGCTCATGATTCAGAAGTGCACAGAGAAGCAGTACCCACCAGGGGACGTGGCTCAGGTGATGGAGACAGCAGTGAGTAGCTTGCAGCCGCGGTGTCCGCAGAACATGCCGATCTACAGAGAAATACAGACTTGCATGGGATGGATCAAGAATCAAATCATGGCTCTTGTCAAAACATGA
Protein:  
MKRRMPIGSLSLSTLESSSFYSSFAATPLSPLSLCLFLHLPNNLFFFFNLSLSLLKNFFRNFSRSLQEASQKLFSHPSSTPCGSTQILFELRYSGVELDWGIWLKGEEDMAPRSVNKRVTKEASSSPDIDKVKQRKKKLTDKLGPQWTKGELERFYDAYRKHGRDWKKVAASVRSNRSADMVEALFSMNRAYLSLPEGTASVAGLIAMMTDHYSVIEESESEGEGHGASGVSRKYQKRKRAKVPPSDVREEVISPHSIASTEGCLSFLKTTQVYGRERRATGKRTPRFLVPSADQRDDTQGSTPPNKRAKKQLEADADDDDDDDDILALALANASTRVGGSPYRRPDTTPNVKMSQAKEAQSKHQASSMSRNVVRISRDRRHIKRSPDRDGALLMDIEGVGNAEVPRKEKNVRTVEAEGDTSDDSGEACNAPRDGLEALHALADLSALLTPGGLMESESSAELKEERVANTRETVSSSHTREKAKQAGREDHSVLHVISAADSRKPKSAQELVDGNAVPIGELDTSRRKRKPLHNKESAEDDNLKTSINARRVGQGPAKQQKTAKTSEESCSTSDKKITRPNEAVSATQVSGSGPASLPQKPPNRRKISLKKSLQERTKSSETTHNKSHSYEIDPEHELLKDKVSTCLSHPLVRRRCIFEWFYSAIDYPWFAKMEFVDYLNHVGLGHVPRLTRLEWSVIKSSLGRPRRFSERFVHEERDKLEQYRESVRKQYAELRAGAREVLHTDLAQPLSVGNRVIAIHPKTREIRDGKILTIDHNKYNVLFDELGVDVVMDIDCMPLNPLEYTPDGLRRQMDNCLTVCGEAQVRKHPSSDASVLFTPSELENVEFSMSHTKKEDDRDRRVTTDQTYNTANRKERRDEIQQDLMLERTSDAQEMEPEMLGIVSGSRSIAQAMVDAAIKAASSVMDDKDAGKMVIQALDSIGEHHQPLDNSIVSRMKHQDQANGSLDHHHQNRSPSNTGEAMNEGLMGSGKNETQMDSELISSCVATWLMIQKCTEKQYPPGDVAQVMETAVSSLQPRCPQNMPIYREIQTCMGWIKNQIMALVKT